Overview
Brought to you by YData
Dataset statistics
| Number of variables | 24 |
|---|---|
| Number of observations | 7684 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 6.2 MiB |
| Average record size in memory | 840.6 B |
Variable types
| DateTime | 3 |
|---|---|
| Numeric | 10 |
| Text | 6 |
| Categorical | 4 |
| Boolean | 1 |
customer_gender is highly overall correlated with customer_id and 2 other fields | High correlation |
customer_id is highly overall correlated with customer_gender and 2 other fields | High correlation |
email_domain is highly overall correlated with customer_gender and 2 other fields | High correlation |
gender_encoded is highly overall correlated with customer_gender and 2 other fields | High correlation |
high_value_item is highly overall correlated with item_price | High correlation |
item_id is highly overall correlated with order_id and 1 other fields | High correlation |
item_price is highly overall correlated with high_value_item and 2 other fields | High correlation |
item_unit_total is highly overall correlated with item_price and 1 other fields | High correlation |
order_id is highly overall correlated with item_id and 1 other fields | High correlation |
order_total is highly overall correlated with item_price and 1 other fields | High correlation |
product_id is highly overall correlated with item_id and 1 other fields | High correlation |
customer_gender is highly imbalanced (63.7%) | Imbalance |
email_domain is highly imbalanced (60.3%) | Imbalance |
gender_encoded is highly imbalanced (63.7%) | Imbalance |
item_qty_order is highly skewed (γ1 = 47.15205892) | Skewed |
item_id has unique values | Unique |
order_total has 418 (5.4%) zeros | Zeros |
total_qty_ordered has 202 (2.6%) zeros | Zeros |
item_price has 265 (3.4%) zeros | Zeros |
item_unit_total has 265 (3.4%) zeros | Zeros |
Reproduction
| Analysis started | 2025-06-06 18:58:06.617694 |
|---|---|
| Analysis finished | 2025-06-06 18:58:41.717020 |
| Duration | 35.1 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
order_created_at
Date
| Distinct | 4754 |
|---|---|
| Distinct (%) | 61.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 60.2 KiB |
| Minimum | 2016-11-23 12:12:16+00:00 |
|---|---|
| Maximum | 2021-08-29 08:54:01+00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
item_id
Real number (ℝ)
High correlation  Unique 
| Distinct | 7684 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 102547.99 |
| Minimum | 13 |
|---|---|
| Maximum | 409358 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 60.2 KiB |
Quantile statistics
| Minimum | 13 |
|---|---|
| 5-th percentile | 636.15 |
| Q1 | 4173.75 |
| median | 10115.5 |
| Q3 | 23268.25 |
| 95-th percentile | 404200.85 |
| Maximum | 409358 |
| Range | 409345 |
| Interquartile range (IQR) | 19094.5 |
Descriptive statistics
| Standard deviation | 166690.06 |
|---|---|
| Coefficient of variation (CV) | 1.6254834 |
| Kurtosis | -0.57652369 |
| Mean | 102547.99 |
| Median Absolute Deviation (MAD) | 7454.5 |
| Skewness | 1.1893763 |
| Sum | 7.8797878 × 108 |
| Variance | 2.7785576 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 408776 | 1 | < 0.1% |
| 13 | 1 | < 0.1% |
| 43 | 1 | < 0.1% |
| 53 | 1 | < 0.1% |
| 404844 | 1 | < 0.1% |
| 404421 | 1 | < 0.1% |
| 404252 | 1 | < 0.1% |
| 404237 | 1 | < 0.1% |
| 404191 | 1 | < 0.1% |
| 403730 | 1 | < 0.1% |
| Other values (7674) | 7674 |
| Value | Count | Frequency (%) |
| 13 | 1 | |
| 14 | 1 | |
| 15 | 1 | |
| 41 | 1 | |
| 43 | 1 | |
| 44 | 1 | |
| 45 | 1 | |
| 47 | 1 | |
| 48 | 1 | |
| 53 | 1 |
| Value | Count | Frequency (%) |
| 409358 | 1 | |
| 409357 | 1 | |
| 409352 | 1 | |
| 409351 | 1 | |
| 409213 | 1 | |
| 409212 | 1 | |
| 409204 | 1 | |
| 409164 | 1 | |
| 409163 | 1 | |
| 409114 | 1 |
order_id
Real number (ℝ)
High correlation 
| Distinct | 4787 |
|---|---|
| Distinct (%) | 62.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 56561.859 |
| Minimum | 0 |
|---|---|
| Maximum | 227752 |
| Zeros | 5 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 60.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 341.75 |
| Q1 | 2538.75 |
| median | 6498.5 |
| Q3 | 16060 |
| 95-th percentile | 223736.85 |
| Maximum | 227752 |
| Range | 227752 |
| Interquartile range (IQR) | 13521.25 |
Descriptive statistics
| Standard deviation | 90899.775 |
|---|---|
| Coefficient of variation (CV) | 1.6070861 |
| Kurtosis | -0.56513151 |
| Mean | 56561.859 |
| Median Absolute Deviation (MAD) | 4945 |
| Skewness | 1.1908462 |
| Sum | 4.3462132 × 108 |
| Variance | 8.2627691 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 224889 | 89 | 1.2% |
| 1406 | 15 | 0.2% |
| 9829 | 15 | 0.2% |
| 16035 | 11 | 0.1% |
| 208011 | 11 | 0.1% |
| 224888 | 10 | 0.1% |
| 208007 | 10 | 0.1% |
| 1057 | 9 | 0.1% |
| 10153 | 9 | 0.1% |
| 14922 | 9 | 0.1% |
| Other values (4777) | 7496 |
| Value | Count | Frequency (%) |
| 0 | 5 | |
| 9 | 2 | < 0.1% |
| 10 | 1 | < 0.1% |
| 23 | 1 | < 0.1% |
| 24 | 2 | < 0.1% |
| 25 | 3 | |
| 28 | 1 | < 0.1% |
| 31 | 5 | |
| 32 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 227752 | 2 | |
| 227747 | 2 | |
| 227635 | 2 | |
| 227627 | 1 | |
| 227593 | 2 | |
| 227568 | 1 | |
| 227550 | 2 | |
| 227543 | 1 | |
| 227486 | 1 | |
| 227401 | 1 |
order_number
Text
| Distinct | 4787 |
|---|---|
| Distinct (%) | 62.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 501.8 KiB |
Length
| Max length | 16 |
|---|---|
| Median length | 15 |
| Mean length | 9.8551536 |
| Min length | 3 |
Unique
| Unique | 3038 ? |
|---|---|
| Unique (%) | 39.5% |
Sample
| 1st row | A1123000010463 |
|---|---|
| 2nd row | A1128000027180 |
| 3rd row | A1128000032136 |
| 4th row | A1128000037298 |
| 5th row | A1206000072983 |
| Value | Count | Frequency (%) |
| bob | 147 | 1.9% |
| b1bos011121-2 | 89 | 1.1% |
| bou04302 | 15 | 0.2% |
| bou011091793 | 15 | 0.2% |
| bou11770869 | 11 | 0.1% |
| bou05371738 | 11 | 0.1% |
| bos116908 | 10 | 0.1% |
| b1bos011121-1 | 10 | 0.1% |
| bou05390151 | 9 | 0.1% |
| bou03315 | 9 | 0.1% |
| Other values (4778) | 7505 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 12185 | |
| 0 | 9867 | |
| B | 8344 | |
| O | 7420 | |
| 2 | 6037 | |
| U | 5642 | |
| 3 | 4520 | 6.0% |
| 5 | 4075 | 5.4% |
| 9 | 4036 | 5.3% |
| 7 | 3640 | 4.8% |
| Other values (11) | 9961 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 75727 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 12185 | |
| 0 | 9867 | |
| B | 8344 | |
| O | 7420 | |
| 2 | 6037 | |
| U | 5642 | |
| 3 | 4520 | 6.0% |
| 5 | 4075 | 5.4% |
| 9 | 4036 | 5.3% |
| 7 | 3640 | 4.8% |
| Other values (11) | 9961 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 75727 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 12185 | |
| 0 | 9867 | |
| B | 8344 | |
| O | 7420 | |
| 2 | 6037 | |
| U | 5642 | |
| 3 | 4520 | 6.0% |
| 5 | 4075 | 5.4% |
| 9 | 4036 | 5.3% |
| 7 | 3640 | 4.8% |
| Other values (11) | 9961 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 75727 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 12185 | |
| 0 | 9867 | |
| B | 8344 | |
| O | 7420 | |
| 2 | 6037 | |
| U | 5642 | |
| 3 | 4520 | 6.0% |
| 5 | 4075 | 5.4% |
| 9 | 4036 | 5.3% |
| 7 | 3640 | 4.8% |
| Other values (11) | 9961 |
order_total
Real number (ℝ)
High correlation  Zeros 
| Distinct | 2198 |
|---|---|
| Distinct (%) | 28.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6288.4659 |
| Minimum | 0 |
|---|---|
| Maximum | 473800 |
| Zeros | 418 |
| Zeros (%) | 5.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 60.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 583 |
| median | 2425 |
| Q3 | 5845.35 |
| 95-th percentile | 21510 |
| Maximum | 473800 |
| Range | 473800 |
| Interquartile range (IQR) | 5262.35 |
Descriptive statistics
| Standard deviation | 20415.032 |
|---|---|
| Coefficient of variation (CV) | 3.2464249 |
| Kurtosis | 288.85813 |
| Mean | 6288.4659 |
| Median Absolute Deviation (MAD) | 2103.5 |
| Skewness | 14.741033 |
| Sum | 48320572 |
| Variance | 4.1677354 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 418 | 5.4% |
| 54119 | 89 | 1.2% |
| 2425 | 80 | 1.0% |
| 2450 | 56 | 0.7% |
| 22 | 36 | 0.5% |
| 16 | 32 | 0.4% |
| 200 | 30 | 0.4% |
| 3500 | 28 | 0.4% |
| 275 | 26 | 0.3% |
| 925 | 24 | 0.3% |
| Other values (2188) | 6865 |
| Value | Count | Frequency (%) |
| 0 | 418 | |
| 6.75 | 1 | < 0.1% |
| 11 | 5 | 0.1% |
| 12.38 | 1 | < 0.1% |
| 14.5 | 1 | < 0.1% |
| 15 | 2 | < 0.1% |
| 15.75 | 1 | < 0.1% |
| 16 | 32 | 0.4% |
| 17 | 13 | 0.2% |
| 19 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 473800 | 4 | |
| 423100 | 6 | |
| 196100 | 5 | |
| 193775 | 2 | < 0.1% |
| 159870 | 4 | |
| 154000 | 3 | |
| 137615 | 5 | |
| 132450 | 2 | < 0.1% |
| 126714 | 3 | |
| 122000 | 3 |
total_qty_ordered
Real number (ℝ)
Zeros 
| Distinct | 20 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.3515096 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 202 |
| Zeros (%) | 2.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 60.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 2.5301918 |
|---|---|
| Coefficient of variation (CV) | 1.0759862 |
| Kurtosis | 369.58396 |
| Mean | 2.3515096 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 12.649151 |
| Sum | 18069 |
| Variance | 6.4018707 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3073 | |
| 2 | 1939 | |
| 3 | 1087 | 14.1% |
| 4 | 750 | 9.8% |
| 5 | 269 | 3.5% |
| 0 | 202 | 2.6% |
| 6 | 96 | 1.2% |
| 7 | 61 | 0.8% |
| 8 | 47 | 0.6% |
| 11 | 46 | 0.6% |
| Other values (10) | 114 | 1.5% |
| Value | Count | Frequency (%) |
| 0 | 202 | 2.6% |
| 1 | 3073 | |
| 2 | 1939 | |
| 3 | 1087 | 14.1% |
| 4 | 750 | 9.8% |
| 5 | 269 | 3.5% |
| 6 | 96 | 1.2% |
| 7 | 61 | 0.8% |
| 8 | 47 | 0.6% |
| 9 | 26 | 0.3% |
| Value | Count | Frequency (%) |
| 100 | 1 | < 0.1% |
| 60 | 2 | < 0.1% |
| 30 | 3 | < 0.1% |
| 21 | 6 | 0.1% |
| 20 | 2 | < 0.1% |
| 17 | 15 | 0.2% |
| 14 | 6 | 0.1% |
| 12 | 24 | |
| 11 | 46 | |
| 10 | 29 |
customer_id
Real number (ℝ)
High correlation 
| Distinct | 54 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30.581598 |
| Minimum | 4 |
|---|---|
| Maximum | 149 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 60.2 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 24 |
| Q1 | 24 |
| median | 24 |
| Q3 | 41 |
| 95-th percentile | 67 |
| Maximum | 149 |
| Range | 145 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 16.332019 |
|---|---|
| Coefficient of variation (CV) | 0.53404727 |
| Kurtosis | 16.361913 |
| Mean | 30.581598 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.505884 |
| Sum | 234989 |
| Variance | 266.73485 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 24 | 5289 | |
| 41 | 1354 | 17.6% |
| 16 | 302 | 3.9% |
| 67 | 168 | 2.2% |
| 98 | 84 | 1.1% |
| 49 | 74 | 1.0% |
| 48 | 70 | 0.9% |
| 79 | 47 | 0.6% |
| 46 | 31 | 0.4% |
| 20 | 20 | 0.3% |
| Other values (44) | 245 | 3.2% |
| Value | Count | Frequency (%) |
| 4 | 16 | 0.2% |
| 5 | 5 | 0.1% |
| 6 | 11 | 0.1% |
| 7 | 3 | < 0.1% |
| 10 | 7 | 0.1% |
| 13 | 3 | < 0.1% |
| 14 | 7 | 0.1% |
| 16 | 302 | |
| 19 | 1 | < 0.1% |
| 20 | 20 | 0.3% |
| Value | Count | Frequency (%) |
| 149 | 7 | |
| 148 | 17 | |
| 140 | 2 | < 0.1% |
| 139 | 6 | 0.1% |
| 137 | 1 | < 0.1% |
| 134 | 12 | |
| 122 | 1 | < 0.1% |
| 111 | 1 | < 0.1% |
| 109 | 1 | < 0.1% |
| 106 | 12 |
customer_name
Text
| Distinct | 54 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 516.9 KiB |
Length
| Max length | 18 |
|---|---|
| Median length | 12 |
| Mean length | 11.864133 |
| Min length | 9 |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Shantae Guseo |
|---|---|
| 2nd row | Linn Finco |
| 3rd row | Joellen Best |
| 4th row | Linn Finco |
| 5th row | Chung Bolger |
| Value | Count | Frequency (%) |
| chung | 5289 | |
| bolger | 5289 | |
| rex | 1354 | 8.8% |
| ebonie | 1354 | 8.8% |
| cheryl | 302 | 2.0% |
| mcginnis | 302 | 2.0% |
| evonne | 168 | 1.1% |
| elenora | 168 | 1.1% |
| sherryl | 84 | 0.5% |
| socolow | 84 | 0.5% |
| Other values (76) | 974 | 6.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| g | 10892 | |
| e | 9496 | |
| n | 8407 | |
| 7684 | ||
| o | 7543 | |
| r | 6272 | 6.9% |
| l | 6214 | 6.8% |
| h | 5718 | 6.3% |
| C | 5658 | 6.2% |
| u | 5337 | 5.9% |
| Other values (35) | 17943 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 91164 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| g | 10892 | |
| e | 9496 | |
| n | 8407 | |
| 7684 | ||
| o | 7543 | |
| r | 6272 | 6.9% |
| l | 6214 | 6.8% |
| h | 5718 | 6.3% |
| C | 5658 | 6.2% |
| u | 5337 | 5.9% |
| Other values (35) | 17943 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 91164 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| g | 10892 | |
| e | 9496 | |
| n | 8407 | |
| 7684 | ||
| o | 7543 | |
| r | 6272 | 6.9% |
| l | 6214 | 6.8% |
| h | 5718 | 6.3% |
| C | 5658 | 6.2% |
| u | 5337 | 5.9% |
| Other values (35) | 17943 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 91164 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| g | 10892 | |
| e | 9496 | |
| n | 8407 | |
| 7684 | ||
| o | 7543 | |
| r | 6272 | 6.9% |
| l | 6214 | 6.8% |
| h | 5718 | 6.3% |
| C | 5658 | 6.2% |
| u | 5337 | 5.9% |
| Other values (35) | 17943 |
customer_gender
Categorical
High correlation  Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 471.8 KiB |
| Female | |
|---|---|
| Male | 532 |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.8615305 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Male |
|---|---|
| 2nd row | Female |
| 3rd row | Male |
| 4th row | Female |
| 5th row | Female |
Common Values
| Value | Count | Frequency (%) |
| Female | 7152 | |
| Male | 532 | 6.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| female | 7152 | |
| male | 532 | 6.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 14836 | |
| a | 7684 | |
| l | 7684 | |
| F | 7152 | |
| m | 7152 | |
| M | 532 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 45040 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 14836 | |
| a | 7684 | |
| l | 7684 | |
| F | 7152 | |
| m | 7152 | |
| M | 532 | 1.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 45040 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 14836 | |
| a | 7684 | |
| l | 7684 | |
| F | 7152 | |
| m | 7152 | |
| M | 532 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 45040 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 14836 | |
| a | 7684 | |
| l | 7684 | |
| F | 7152 | |
| m | 7152 | |
| M | 532 | 1.2% |
customer_email
Text
| Distinct | 54 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 553.3 KiB |
Length
| Max length | 21 |
|---|---|
| Median length | 17 |
| Mean length | 16.712129 |
| Min length | 12 |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | debest@verizon.net |
|---|---|
| 2nd row | shang@live.com |
| 3rd row | lbecchi@verizon.net |
| 4th row | shang@live.com |
| 5th row | hyper@outlook.com |
| Value | Count | Frequency (%) |
| hyper@outlook.com | 5289 | |
| lahvak@yahoo.com | 1354 | 17.6% |
| mugwump@att.net | 302 | 3.9% |
| dieman@verizon.net | 168 | 2.2% |
| hling@hotmail.com | 84 | 1.1% |
| jgwang@att.net | 74 | 1.0% |
| inico@live.com | 70 | 0.9% |
| gmcgath@att.net | 47 | 0.6% |
| kspiteri@me.com | 31 | 0.4% |
| shang@live.com | 20 | 0.3% |
| Other values (44) | 245 | 3.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 26393 | |
| h | 8406 | 6.5% |
| m | 8086 | 6.3% |
| . | 7684 | 6.0% |
| @ | 7684 | 6.0% |
| c | 7308 | 5.7% |
| t | 7155 | 5.6% |
| l | 7097 | 5.5% |
| y | 6762 | 5.3% |
| k | 6736 | 5.2% |
| Other values (17) | 35105 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 128416 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 26393 | |
| h | 8406 | 6.5% |
| m | 8086 | 6.3% |
| . | 7684 | 6.0% |
| @ | 7684 | 6.0% |
| c | 7308 | 5.7% |
| t | 7155 | 5.6% |
| l | 7097 | 5.5% |
| y | 6762 | 5.3% |
| k | 6736 | 5.2% |
| Other values (17) | 35105 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 128416 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 26393 | |
| h | 8406 | 6.5% |
| m | 8086 | 6.3% |
| . | 7684 | 6.0% |
| @ | 7684 | 6.0% |
| c | 7308 | 5.7% |
| t | 7155 | 5.6% |
| l | 7097 | 5.5% |
| y | 6762 | 5.3% |
| k | 6736 | 5.2% |
| Other values (17) | 35105 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 128416 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 26393 | |
| h | 8406 | 6.5% |
| m | 8086 | 6.3% |
| . | 7684 | 6.0% |
| @ | 7684 | 6.0% |
| c | 7308 | 5.7% |
| t | 7155 | 5.6% |
| l | 7097 | 5.5% |
| y | 6762 | 5.3% |
| k | 6736 | 5.2% |
| Other values (17) | 35105 |
product_id
Real number (ℝ)
High correlation 
| Distinct | 4296 |
|---|---|
| Distinct (%) | 55.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12706491 |
| Minimum | 53067 |
|---|---|
| Maximum | 1.6135317 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 60.2 KiB |
Quantile statistics
| Minimum | 53067 |
|---|---|
| 5-th percentile | 55661.55 |
| Q1 | 67710.75 |
| median | 125040 |
| Q3 | 215058 |
| 95-th percentile | 1.6105976 × 108 |
| Maximum | 1.6135317 × 108 |
| Range | 1.613001 × 108 |
| Interquartile range (IQR) | 147347.25 |
Descriptive statistics
| Standard deviation | 43171656 |
|---|---|
| Coefficient of variation (CV) | 3.3976065 |
| Kurtosis | 7.8973055 |
| Mean | 12706491 |
| Median Absolute Deviation (MAD) | 62786 |
| Skewness | 3.1456502 |
| Sum | 9.7636675 × 1010 |
| Variance | 1.8637919 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 86775 | 217 | 2.8% |
| 125040 | 130 | 1.7% |
| 84037 | 103 | 1.3% |
| 60971 | 74 | 1.0% |
| 63334 | 50 | 0.7% |
| 277139 | 34 | 0.4% |
| 105533 | 33 | 0.4% |
| 105666 | 31 | 0.4% |
| 60980 | 26 | 0.3% |
| 62449 | 26 | 0.3% |
| Other values (4286) | 6960 |
| Value | Count | Frequency (%) |
| 53067 | 3 | |
| 53071 | 3 | |
| 53072 | 1 | < 0.1% |
| 53073 | 1 | < 0.1% |
| 53076 | 2 | < 0.1% |
| 53081 | 5 | |
| 53082 | 1 | < 0.1% |
| 53089 | 6 | |
| 53092 | 1 | < 0.1% |
| 53093 | 4 |
| Value | Count | Frequency (%) |
| 161353169 | 1 | < 0.1% |
| 161353047 | 6 | 0.1% |
| 161352739 | 1 | < 0.1% |
| 161351820 | 1 | < 0.1% |
| 161351766 | 2 | < 0.1% |
| 161351699 | 2 | < 0.1% |
| 161351683 | 7 | |
| 161351579 | 16 | |
| 161351534 | 1 | < 0.1% |
| 161351414 | 1 | < 0.1% |
product_sku
Text
| Distinct | 4296 |
|---|---|
| Distinct (%) | 55.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 495.4 KiB |
Length
| Max length | 18 |
|---|---|
| Median length | 9 |
| Mean length | 9.0078084 |
| Min length | 8 |
Unique
| Unique | 3239 ? |
|---|---|
| Unique (%) | 42.2% |
Sample
| 1st row | 210768756 |
|---|---|
| 2nd row | 210759724 |
| 3rd row | 210810839 |
| 4th row | 210759761 |
| 5th row | 210763929 |
| Value | Count | Frequency (%) |
| 210173084 | 217 | 2.8% |
| 211532988 | 130 | 1.7% |
| 211230946 | 103 | 1.3% |
| 210942128 | 74 | 1.0% |
| 209630413 | 50 | 0.7% |
| 202545536 | 34 | 0.4% |
| 204770019 | 33 | 0.4% |
| 211104825 | 31 | 0.4% |
| 210942137 | 26 | 0.3% |
| 212232167 | 26 | 0.3% |
| Other values (4286) | 6960 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 14732 | |
| 1 | 12994 | |
| 0 | 7932 | |
| 7 | 5189 | 7.5% |
| 9 | 5014 | 7.2% |
| 8 | 4871 | 7.0% |
| 4 | 4720 | 6.8% |
| 3 | 4484 | 6.5% |
| 6 | 4450 | 6.4% |
| 5 | 4441 | 6.4% |
| Other values (17) | 389 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 69216 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 14732 | |
| 1 | 12994 | |
| 0 | 7932 | |
| 7 | 5189 | 7.5% |
| 9 | 5014 | 7.2% |
| 8 | 4871 | 7.0% |
| 4 | 4720 | 6.8% |
| 3 | 4484 | 6.5% |
| 6 | 4450 | 6.4% |
| 5 | 4441 | 6.4% |
| Other values (17) | 389 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 69216 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 14732 | |
| 1 | 12994 | |
| 0 | 7932 | |
| 7 | 5189 | 7.5% |
| 9 | 5014 | 7.2% |
| 8 | 4871 | 7.0% |
| 4 | 4720 | 6.8% |
| 3 | 4484 | 6.5% |
| 6 | 4450 | 6.4% |
| 5 | 4441 | 6.4% |
| Other values (17) | 389 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 69216 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 14732 | |
| 1 | 12994 | |
| 0 | 7932 | |
| 7 | 5189 | 7.5% |
| 9 | 5014 | 7.2% |
| 8 | 4871 | 7.0% |
| 4 | 4720 | 6.8% |
| 3 | 4484 | 6.5% |
| 6 | 4450 | 6.4% |
| 5 | 4441 | 6.4% |
| Other values (17) | 389 | 0.6% |
product_name
Text
| Distinct | 3480 |
|---|---|
| Distinct (%) | 45.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 719.4 KiB |
Length
| Max length | 107 |
|---|---|
| Median length | 79 |
| Mean length | 32.255726 |
| Min length | 7 |
Unique
| Unique | 2260 ? |
|---|---|
| Unique (%) | 29.4% |
Sample
| 1st row | 210768942 |
|---|---|
| 2nd row | Black Long Velvet Jacket |
| 3rd row | KEELY 100 SATIN BOW 100MM MULE:Light/Pastel Pink :39.5 - 210809529 |
| 4th row | Off-White Bateau Neck Long Tank Top |
| 5th row | Painted Blue Kimberly Plaid Shirt |
| Value | Count | Frequency (%) |
| black | 1150 | 3.1% |
| 896 | 2.4% | |
| bag | 487 | 1.3% |
| leather | 483 | 1.3% |
| gold | 455 | 1.2% |
| white | 453 | 1.2% |
| red | 424 | 1.2% |
| dress | 416 | 1.1% |
| set | 298 | 0.8% |
| pumps | 270 | 0.7% |
| Other values (3965) | 31334 |
Most occurring characters
| Value | Count | Frequency (%) |
| 28984 | 11.7% | |
| e | 22317 | 9.0% |
| a | 15733 | 6.3% |
| l | 13213 | 5.3% |
| r | 12674 | 5.1% |
| i | 11845 | 4.8% |
| t | 10521 | 4.2% |
| o | 10354 | 4.2% |
| n | 9019 | 3.6% |
| s | 7786 | 3.1% |
| Other values (109) | 105407 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 247853 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 28984 | 11.7% | |
| e | 22317 | 9.0% |
| a | 15733 | 6.3% |
| l | 13213 | 5.3% |
| r | 12674 | 5.1% |
| i | 11845 | 4.8% |
| t | 10521 | 4.2% |
| o | 10354 | 4.2% |
| n | 9019 | 3.6% |
| s | 7786 | 3.1% |
| Other values (109) | 105407 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 247853 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 28984 | 11.7% | |
| e | 22317 | 9.0% |
| a | 15733 | 6.3% |
| l | 13213 | 5.3% |
| r | 12674 | 5.1% |
| i | 11845 | 4.8% |
| t | 10521 | 4.2% |
| o | 10354 | 4.2% |
| n | 9019 | 3.6% |
| s | 7786 | 3.1% |
| Other values (109) | 105407 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 247853 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 28984 | 11.7% | |
| e | 22317 | 9.0% |
| a | 15733 | 6.3% |
| l | 13213 | 5.3% |
| r | 12674 | 5.1% |
| i | 11845 | 4.8% |
| t | 10521 | 4.2% |
| o | 10354 | 4.2% |
| n | 9019 | 3.6% |
| s | 7786 | 3.1% |
| Other values (109) | 105407 |
item_price
Real number (ℝ)
High correlation  Zeros 
| Distinct | 1419 |
|---|---|
| Distinct (%) | 18.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2220.2441 |
| Minimum | 0 |
|---|---|
| Maximum | 67000 |
| Zeros | 265 |
| Zeros (%) | 3.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 60.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 16 |
| Q1 | 215.24 |
| median | 1047.62 |
| Q3 | 2571.43 |
| 95-th percentile | 7550 |
| Maximum | 67000 |
| Range | 67000 |
| Interquartile range (IQR) | 2356.19 |
Descriptive statistics
| Standard deviation | 4494.1955 |
|---|---|
| Coefficient of variation (CV) | 2.0241898 |
| Kurtosis | 61.116443 |
| Mean | 2220.2441 |
| Median Absolute Deviation (MAD) | 902.5 |
| Skewness | 6.7723654 |
| Sum | 17060356 |
| Variance | 20197793 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 265 | 3.4% |
| 16 | 209 | 2.7% |
| 925 | 141 | 1.8% |
| 1500 | 119 | 1.5% |
| 175 | 93 | 1.2% |
| 3000 | 72 | 0.9% |
| 200 | 65 | 0.8% |
| 2000 | 49 | 0.6% |
| 1850 | 49 | 0.6% |
| 1450 | 46 | 0.6% |
| Other values (1409) | 6576 |
| Value | Count | Frequency (%) |
| 0 | 265 | |
| 5.1 | 1 | < 0.1% |
| 7.5 | 1 | < 0.1% |
| 9.29 | 1 | < 0.1% |
| 9.5 | 1 | < 0.1% |
| 9.75 | 1 | < 0.1% |
| 10.75 | 1 | < 0.1% |
| 12.25 | 1 | < 0.1% |
| 15.25 | 1 | < 0.1% |
| 16 | 209 |
| Value | Count | Frequency (%) |
| 67000 | 3 | |
| 57700 | 1 | < 0.1% |
| 48900 | 2 | < 0.1% |
| 48190.48 | 1 | < 0.1% |
| 47800 | 1 | < 0.1% |
| 45600 | 1 | < 0.1% |
| 45476.19 | 1 | < 0.1% |
| 45000 | 4 | |
| 44761.9 | 5 | |
| 43400 | 4 |
item_qty_order
Real number (ℝ)
Skewed 
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.1353462 |
| Minimum | 1 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 60.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 100 |
| Range | 99 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.5634958 |
|---|---|
| Coefficient of variation (CV) | 1.3771093 |
| Kurtosis | 2604.7246 |
| Mean | 1.1353462 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 47.152059 |
| Sum | 8724 |
| Variance | 2.4445191 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 7156 | |
| 2 | 374 | 4.9% |
| 3 | 84 | 1.1% |
| 4 | 37 | 0.5% |
| 5 | 12 | 0.2% |
| 6 | 6 | 0.1% |
| 7 | 5 | 0.1% |
| 8 | 4 | 0.1% |
| 60 | 2 | < 0.1% |
| 100 | 1 | < 0.1% |
| Other values (3) | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 7156 | |
| 2 | 374 | 4.9% |
| 3 | 84 | 1.1% |
| 4 | 37 | 0.5% |
| 5 | 12 | 0.2% |
| 6 | 6 | 0.1% |
| 7 | 5 | 0.1% |
| 8 | 4 | 0.1% |
| 10 | 1 | < 0.1% |
| 12 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 1 | < 0.1% |
| 60 | 2 | < 0.1% |
| 15 | 1 | < 0.1% |
| 12 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 8 | 4 | 0.1% |
| 7 | 5 | 0.1% |
| 6 | 6 | 0.1% |
| 5 | 12 | 0.2% |
| 4 | 37 |
item_unit_total
Real number (ℝ)
High correlation  Zeros 
| Distinct | 1558 |
|---|---|
| Distinct (%) | 20.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2414.6489 |
| Minimum | 0 |
|---|---|
| Maximum | 214285.7 |
| Zeros | 265 |
| Zeros (%) | 3.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 60.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 16 |
| Q1 | 240 |
| median | 1100 |
| Q3 | 2666.67 |
| 95-th percentile | 8095.24 |
| Maximum | 214285.7 |
| Range | 214285.7 |
| Interquartile range (IQR) | 2426.67 |
Descriptive statistics
| Standard deviation | 6065.8839 |
|---|---|
| Coefficient of variation (CV) | 2.5121183 |
| Kurtosis | 299.88535 |
| Mean | 2414.6489 |
| Median Absolute Deviation (MAD) | 948 |
| Skewness | 13.282973 |
| Sum | 18554163 |
| Variance | 36794947 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 265 | 3.4% |
| 16 | 183 | 2.4% |
| 925 | 123 | 1.6% |
| 1500 | 103 | 1.3% |
| 3000 | 84 | 1.1% |
| 175 | 74 | 1.0% |
| 1850 | 58 | 0.8% |
| 2000 | 50 | 0.7% |
| 1000 | 49 | 0.6% |
| 200 | 44 | 0.6% |
| Other values (1548) | 6651 |
| Value | Count | Frequency (%) |
| 0 | 265 | |
| 5.1 | 1 | < 0.1% |
| 7.5 | 1 | < 0.1% |
| 9.29 | 1 | < 0.1% |
| 9.5 | 1 | < 0.1% |
| 9.75 | 1 | < 0.1% |
| 10.75 | 1 | < 0.1% |
| 12.25 | 1 | < 0.1% |
| 15.25 | 1 | < 0.1% |
| 16 | 183 |
| Value | Count | Frequency (%) |
| 214285.7 | 1 | < 0.1% |
| 130200 | 1 | < 0.1% |
| 128571.42 | 1 | < 0.1% |
| 114750 | 1 | < 0.1% |
| 89523.8 | 2 | |
| 79047.62 | 2 | |
| 79000 | 1 | < 0.1% |
| 68380.96 | 2 | |
| 67043.49 | 1 | < 0.1% |
| 67000 | 3 |
order_date
Date
| Distinct | 811 |
|---|---|
| Distinct (%) | 10.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 300.3 KiB |
| Minimum | 2016-11-23 00:00:00 |
|---|---|
| Maximum | 2021-08-29 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
order_month
Date
| Distinct | 58 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 60.2 KiB |
| Minimum | 2016-11-01 00:00:00 |
|---|---|
| Maximum | 2021-08-01 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
order_day_of_week
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 481.4 KiB |
| Tuesday | |
|---|---|
| Monday | |
| Thursday | |
| Wednesday | |
| Sunday | |
| Other values (2) |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 7.1326132 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Wednesday |
|---|---|
| 2nd row | Monday |
| 3rd row | Monday |
| 4th row | Monday |
| 5th row | Tuesday |
Common Values
| Value | Count | Frequency (%) |
| Tuesday | 1799 | |
| Monday | 1437 | |
| Thursday | 1379 | |
| Wednesday | 1270 | |
| Sunday | 1206 | |
| Friday | 425 | 5.5% |
| Saturday | 168 | 2.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| tuesday | 1799 | |
| monday | 1437 | |
| thursday | 1379 | |
| wednesday | 1270 | |
| sunday | 1206 | |
| friday | 425 | 5.5% |
| saturday | 168 | 2.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| d | 8954 | |
| a | 7852 | |
| y | 7684 | |
| u | 4552 | |
| s | 4448 | |
| e | 4339 | |
| n | 3913 | |
| T | 3178 | 5.8% |
| r | 1972 | 3.6% |
| o | 1437 | 2.6% |
| Other values (7) | 6478 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 54807 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| d | 8954 | |
| a | 7852 | |
| y | 7684 | |
| u | 4552 | |
| s | 4448 | |
| e | 4339 | |
| n | 3913 | |
| T | 3178 | 5.8% |
| r | 1972 | 3.6% |
| o | 1437 | 2.6% |
| Other values (7) | 6478 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 54807 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| d | 8954 | |
| a | 7852 | |
| y | 7684 | |
| u | 4552 | |
| s | 4448 | |
| e | 4339 | |
| n | 3913 | |
| T | 3178 | 5.8% |
| r | 1972 | 3.6% |
| o | 1437 | 2.6% |
| Other values (7) | 6478 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 54807 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| d | 8954 | |
| a | 7852 | |
| y | 7684 | |
| u | 4552 | |
| s | 4448 | |
| e | 4339 | |
| n | 3913 | |
| T | 3178 | 5.8% |
| r | 1972 | 3.6% |
| o | 1437 | 2.6% |
| Other values (7) | 6478 |
order_hour
Real number (ℝ)
| Distinct | 23 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.312858 |
| Minimum | 0 |
|---|---|
| Maximum | 23 |
| Zeros | 5 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 8 |
| median | 10 |
| Q3 | 12 |
| 95-th percentile | 16 |
| Maximum | 23 |
| Range | 23 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 3.5092687 |
|---|---|
| Coefficient of variation (CV) | 0.34028091 |
| Kurtosis | 1.6268977 |
| Mean | 10.312858 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.93058078 |
| Sum | 79244 |
| Variance | 12.314967 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 913 | |
| 8 | 893 | |
| 11 | 855 | |
| 13 | 775 | |
| 10 | 771 | |
| 12 | 768 | |
| 9 | 699 | |
| 6 | 567 | |
| 14 | 500 | |
| 5 | 263 | 3.4% |
| Other values (13) | 680 |
| Value | Count | Frequency (%) |
| 0 | 5 | 0.1% |
| 1 | 4 | 0.1% |
| 3 | 17 | 0.2% |
| 4 | 39 | 0.5% |
| 5 | 263 | 3.4% |
| 6 | 567 | |
| 7 | 913 | |
| 8 | 893 | |
| 9 | 699 | |
| 10 | 771 |
| Value | Count | Frequency (%) |
| 23 | 46 | 0.6% |
| 22 | 114 | 1.5% |
| 21 | 30 | 0.4% |
| 20 | 40 | 0.5% |
| 19 | 12 | 0.2% |
| 18 | 15 | 0.2% |
| 17 | 74 | 1.0% |
| 16 | 72 | 0.9% |
| 15 | 212 | |
| 14 | 500 |
high_value_item
Boolean
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 6163 | |
| True | 1521 | 19.8% |
email_domain
Categorical
High correlation  Imbalance 
| Distinct | 14 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 505.4 KiB |
| outlook.com | |
|---|---|
| yahoo.com | |
| att.net | 427 |
| verizon.net | 179 |
| hotmail.com | 116 |
| Other values (9) | 245 |
Length
| Max length | 13 |
|---|---|
| Median length | 11 |
| Mean length | 10.338365 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | verizon.net |
|---|---|
| 2nd row | live.com |
| 3rd row | verizon.net |
| 4th row | live.com |
| 5th row | outlook.com |
Common Values
| Value | Count | Frequency (%) |
| outlook.com | 5307 | |
| yahoo.com | 1410 | 18.3% |
| att.net | 427 | 5.6% |
| verizon.net | 179 | 2.3% |
| hotmail.com | 116 | 1.5% |
| live.com | 94 | 1.2% |
| me.com | 41 | 0.5% |
| comcast.net | 28 | 0.4% |
| icloud.com | 22 | 0.3% |
| yahoo.ca | 19 | 0.2% |
| Other values (4) | 41 | 0.5% |
Length
| Value | Count | Frequency (%) |
| outlook.com | 5307 | |
| yahoo.com | 1410 | 18.3% |
| att.net | 427 | 5.6% |
| verizon.net | 179 | 2.3% |
| hotmail.com | 116 | 1.5% |
| live.com | 94 | 1.2% |
| me.com | 41 | 0.5% |
| comcast.net | 28 | 0.4% |
| icloud.com | 22 | 0.3% |
| yahoo.ca | 19 | 0.2% |
| Other values (4) | 41 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 26179 | |
| . | 7684 | 9.7% |
| m | 7193 | 9.1% |
| c | 7116 | 9.0% |
| t | 6986 | 8.8% |
| l | 5585 | 7.0% |
| u | 5329 | 6.7% |
| k | 5307 | 6.7% |
| a | 2042 | 2.6% |
| h | 1545 | 1.9% |
| Other values (12) | 4474 | 5.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 79440 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 26179 | |
| . | 7684 | 9.7% |
| m | 7193 | 9.1% |
| c | 7116 | 9.0% |
| t | 6986 | 8.8% |
| l | 5585 | 7.0% |
| u | 5329 | 6.7% |
| k | 5307 | 6.7% |
| a | 2042 | 2.6% |
| h | 1545 | 1.9% |
| Other values (12) | 4474 | 5.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 79440 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 26179 | |
| . | 7684 | 9.7% |
| m | 7193 | 9.1% |
| c | 7116 | 9.0% |
| t | 6986 | 8.8% |
| l | 5585 | 7.0% |
| u | 5329 | 6.7% |
| k | 5307 | 6.7% |
| a | 2042 | 2.6% |
| h | 1545 | 1.9% |
| Other values (12) | 4474 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 79440 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 26179 | |
| . | 7684 | 9.7% |
| m | 7193 | 9.1% |
| c | 7116 | 9.0% |
| t | 6986 | 8.8% |
| l | 5585 | 7.0% |
| u | 5329 | 6.7% |
| k | 5307 | 6.7% |
| a | 2042 | 2.6% |
| h | 1545 | 1.9% |
| Other values (12) | 4474 | 5.6% |
gender_encoded
Categorical
High correlation  Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 435.4 KiB |
| 0 | |
|---|---|
| 1 | 532 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 7152 | |
| 1 | 532 | 6.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 7152 | |
| 1 | 532 | 6.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 7152 | |
| 1 | 532 | 6.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7684 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 7152 | |
| 1 | 532 | 6.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7684 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 7152 | |
| 1 | 532 | 6.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7684 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 7152 | |
| 1 | 532 | 6.9% |
order_week
Text
| Distinct | 225 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 585.4 KiB |
Length
| Max length | 21 |
|---|---|
| Median length | 21 |
| Mean length | 21 |
| Min length | 21 |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 2016-11-21/2016-11-27 |
|---|---|
| 2nd row | 2016-11-28/2016-12-04 |
| 3rd row | 2016-11-28/2016-12-04 |
| 4th row | 2016-11-28/2016-12-04 |
| 5th row | 2016-12-05/2016-12-11 |
| Value | Count | Frequency (%) |
| 2018-11-05/2018-11-11 | 358 | 4.7% |
| 2017-09-25/2017-10-01 | 321 | 4.2% |
| 2016-12-12/2016-12-18 | 292 | 3.8% |
| 2017-08-14/2017-08-20 | 259 | 3.4% |
| 2017-09-11/2017-09-17 | 222 | 2.9% |
| 2016-12-05/2016-12-11 | 188 | 2.4% |
| 2018-08-13/2018-08-19 | 180 | 2.3% |
| 2017-08-07/2017-08-13 | 179 | 2.3% |
| 2018-10-29/2018-11-04 | 162 | 2.1% |
| 2017-09-18/2017-09-24 | 143 | 1.9% |
| Other values (215) | 5380 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 35768 | |
| - | 30736 | |
| 1 | 28278 | |
| 2 | 26814 | |
| 7 | 7976 | 4.9% |
| / | 7684 | 4.8% |
| 8 | 6344 | 3.9% |
| 9 | 5665 | 3.5% |
| 6 | 3197 | 2.0% |
| 3 | 3115 | 1.9% |
| Other values (2) | 5787 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 161364 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 35768 | |
| - | 30736 | |
| 1 | 28278 | |
| 2 | 26814 | |
| 7 | 7976 | 4.9% |
| / | 7684 | 4.8% |
| 8 | 6344 | 3.9% |
| 9 | 5665 | 3.5% |
| 6 | 3197 | 2.0% |
| 3 | 3115 | 1.9% |
| Other values (2) | 5787 | 3.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 161364 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 35768 | |
| - | 30736 | |
| 1 | 28278 | |
| 2 | 26814 | |
| 7 | 7976 | 4.9% |
| / | 7684 | 4.8% |
| 8 | 6344 | 3.9% |
| 9 | 5665 | 3.5% |
| 6 | 3197 | 2.0% |
| 3 | 3115 | 1.9% |
| Other values (2) | 5787 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 161364 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 35768 | |
| - | 30736 | |
| 1 | 28278 | |
| 2 | 26814 | |
| 7 | 7976 | 4.9% |
| / | 7684 | 4.8% |
| 8 | 6344 | 3.9% |
| 9 | 5665 | 3.5% |
| 6 | 3197 | 2.0% |
| 3 | 3115 | 1.9% |
| Other values (2) | 5787 | 3.6% |
Interactions
Correlations
| customer_gender | customer_id | email_domain | gender_encoded | high_value_item | item_id | item_price | item_qty_order | item_unit_total | order_day_of_week | order_hour | order_id | order_total | product_id | total_qty_ordered | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| customer_gender | 1.000 | 0.887 | 0.830 | 0.999 | 0.000 | 0.142 | 0.045 | 0.020 | 0.000 | 0.112 | 0.118 | 0.163 | 0.000 | 0.131 | 0.048 |
| customer_id | 0.887 | 1.000 | 0.704 | 0.887 | 0.080 | 0.113 | 0.080 | 0.005 | 0.081 | 0.111 | -0.096 | 0.115 | 0.088 | 0.083 | -0.110 |
| email_domain | 0.830 | 0.704 | 1.000 | 0.830 | 0.107 | 0.372 | 0.019 | 0.164 | 0.000 | 0.152 | 0.103 | 0.277 | 0.085 | 0.192 | 0.144 |
| gender_encoded | 0.999 | 0.887 | 0.830 | 1.000 | 0.000 | 0.142 | 0.045 | 0.020 | 0.000 | 0.112 | 0.118 | 0.163 | 0.000 | 0.131 | 0.048 |
| high_value_item | 0.000 | 0.080 | 0.107 | 0.000 | 1.000 | 0.075 | 0.514 | 0.035 | 0.204 | 0.000 | 0.074 | 0.100 | 0.126 | 0.085 | 0.039 |
| item_id | 0.142 | 0.113 | 0.372 | 0.142 | 0.075 | 1.000 | 0.021 | -0.012 | 0.012 | 0.104 | 0.005 | 0.999 | 0.002 | 0.809 | -0.066 |
| item_price | 0.045 | 0.080 | 0.019 | 0.045 | 0.514 | 0.021 | 1.000 | -0.123 | 0.987 | 0.024 | -0.033 | 0.021 | 0.637 | 0.064 | 0.049 |
| item_qty_order | 0.020 | 0.005 | 0.164 | 0.020 | 0.035 | -0.012 | -0.123 | 1.000 | 0.011 | 0.008 | 0.018 | -0.012 | -0.010 | 0.007 | 0.211 |
| item_unit_total | 0.000 | 0.081 | 0.000 | 0.000 | 0.204 | 0.012 | 0.987 | 0.011 | 1.000 | 0.010 | -0.029 | 0.012 | 0.644 | 0.059 | 0.083 |
| order_day_of_week | 0.112 | 0.111 | 0.152 | 0.112 | 0.000 | 0.104 | 0.024 | 0.008 | 0.010 | 1.000 | 0.107 | 0.091 | 0.096 | 0.082 | 0.057 |
| order_hour | 0.118 | -0.096 | 0.103 | 0.118 | 0.074 | 0.005 | -0.033 | 0.018 | -0.029 | 0.107 | 1.000 | 0.005 | -0.046 | 0.001 | 0.067 |
| order_id | 0.163 | 0.115 | 0.277 | 0.163 | 0.100 | 0.999 | 0.021 | -0.012 | 0.012 | 0.091 | 0.005 | 1.000 | 0.001 | 0.808 | -0.066 |
| order_total | 0.000 | 0.088 | 0.085 | 0.000 | 0.126 | 0.002 | 0.637 | -0.010 | 0.644 | 0.096 | -0.046 | 0.001 | 1.000 | 0.002 | 0.388 |
| product_id | 0.131 | 0.083 | 0.192 | 0.131 | 0.085 | 0.809 | 0.064 | 0.007 | 0.059 | 0.082 | 0.001 | 0.808 | 0.002 | 1.000 | -0.091 |
| total_qty_ordered | 0.048 | -0.110 | 0.144 | 0.048 | 0.039 | -0.066 | 0.049 | 0.211 | 0.083 | 0.057 | 0.067 | -0.066 | 0.388 | -0.091 | 1.000 |
Missing values
Sample
| order_created_at | item_id | order_id | order_number | order_total | total_qty_ordered | customer_id | customer_name | customer_gender | customer_email | product_id | product_sku | product_name | item_price | item_qty_order | item_unit_total | order_date | order_month | order_day_of_week | order_hour | high_value_item | email_domain | gender_encoded | order_week | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2016-11-23 12:12:16+00:00 | 13 | 9 | A1123000010463 | 2705.00 | 1 | 14 | Shantae Guseo | Male | debest@verizon.net | 62231 | 210768756 | 210768942 | 2700.00 | 1 | 2700.00 | 2016-11-23 | 2016-11 | Wednesday | 12 | False | verizon.net | 1 | 2016-11-21/2016-11-27 |
| 1 | 2016-11-28 05:58:11+00:00 | 43 | 24 | A1128000027180 | 1725.00 | 1 | 20 | Linn Finco | Female | shang@live.com | 62502 | 210759724 | Black Long Velvet Jacket | 1720.00 | 1 | 1720.00 | 2016-11-28 | 2016-11 | Monday | 5 | False | live.com | 0 | 2016-11-28/2016-12-04 |
| 2 | 2016-11-28 06:57:27+00:00 | 53 | 28 | A1128000032136 | 2555.00 | 1 | 23 | Joellen Best | Male | lbecchi@verizon.net | 62627 | 210810839 | KEELY 100 SATIN BOW 100MM MULE:Light/Pastel Pink :39.5 - 210809529 | 2550.00 | 1 | 2550.00 | 2016-11-28 | 2016-11 | Monday | 6 | False | verizon.net | 1 | 2016-11-28/2016-12-04 |
| 3 | 2016-11-28 09:20:12+00:00 | 64 | 31 | A1128000037298 | 0.00 | 4 | 20 | Linn Finco | Female | shang@live.com | 53778 | 210759761 | Off-White Bateau Neck Long Tank Top | 0.00 | 1 | 0.00 | 2016-11-28 | 2016-11 | Monday | 9 | False | live.com | 0 | 2016-11-28/2016-12-04 |
| 4 | 2016-12-06 11:20:23+00:00 | 132 | 62 | A1206000072983 | 6315.00 | 3 | 24 | Chung Bolger | Female | hyper@outlook.com | 62362 | 210763929 | Painted Blue Kimberly Plaid Shirt | 1200.00 | 1 | 1200.00 | 2016-12-06 | 2016-12 | Tuesday | 11 | False | outlook.com | 0 | 2016-12-05/2016-12-11 |
| 5 | 2016-12-06 11:22:28+00:00 | 137 | 63 | A1206000073539 | 1605.00 | 1 | 24 | Chung Bolger | Female | hyper@outlook.com | 53361 | 210749594 | Off-White Agate And Marcasite Gemstone Necklace | 1600.00 | 1 | 1600.00 | 2016-12-06 | 2016-12 | Tuesday | 11 | False | outlook.com | 0 | 2016-12-05/2016-12-11 |
| 6 | 2016-12-06 14:29:05+00:00 | 149 | 70 | A1206000080688 | 1330.00 | 1 | 24 | Chung Bolger | Female | hyper@outlook.com | 62359 | 210763918 | White Lisbon Shirt | 1300.00 | 1 | 1300.00 | 2016-12-06 | 2016-12 | Tuesday | 14 | False | outlook.com | 0 | 2016-12-05/2016-12-11 |
| 7 | 2016-12-07 09:42:55+00:00 | 188 | 84 | A1207000094216 | 3350.00 | 1 | 4 | Jeramy Maurois | Female | dvdotnet@outlook.com | 62534 | 210761054 | Noir Rockstud Patent Pumps | 3320.00 | 1 | 3320.00 | 2016-12-07 | 2016-12 | Wednesday | 9 | True | outlook.com | 0 | 2016-12-05/2016-12-11 |
| 8 | 2016-12-08 14:35:23+00:00 | 211 | 95 | A1208000107360 | 5135.00 | 2 | 24 | Chung Bolger | Female | hyper@outlook.com | 56397 | 210806550 | 210806590 | 0.00 | 2 | 0.00 | 2016-12-08 | 2016-12 | Thursday | 14 | False | outlook.com | 0 | 2016-12-05/2016-12-11 |
| 9 | 2016-12-10 07:40:17+00:00 | 258 | 121 | BOU121710 | 6584.04 | 3 | 6 | Mia Garner | Female | dsowsy@icloud.com | 62586 | 210883460 | Portia 120 Metallic Raffia Wedges | 1854.52 | 1 | 1854.52 | 2016-12-10 | 2016-12 | Saturday | 7 | False | icloud.com | 0 | 2016-12-05/2016-12-11 |
| order_created_at | item_id | order_id | order_number | order_total | total_qty_ordered | customer_id | customer_name | customer_gender | customer_email | product_id | product_sku | product_name | item_price | item_qty_order | item_unit_total | order_date | order_month | order_day_of_week | order_hour | high_value_item | email_domain | gender_encoded | order_week | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 7674 | 2021-01-21 08:37:00+00:00 | 405644 | 224889 | B1BOS011121-2 | 54119.0 | 0 | 41 | Rex Ebonie | Female | lahvak@yahoo.com | 179480 | 211961074 | Acrylic Stacked Obelisk | 1565.22 | 1 | 1565.22 | 2021-01-21 | 2021-01 | Thursday | 8 | False | yahoo.com | 0 | 2021-01-18/2021-01-24 |
| 7675 | 2021-01-21 08:37:00+00:00 | 405668 | 224889 | B1BOS011121-2 | 54119.0 | 0 | 41 | Rex Ebonie | Female | lahvak@yahoo.com | 161098651 | 213286547 | Regular Masha'Allah Box | 256.52 | 2 | 513.04 | 2021-01-21 | 2021-01 | Thursday | 8 | False | yahoo.com | 0 | 2021-01-18/2021-01-24 |
| 7676 | 2021-01-28 05:14:46+00:00 | 406043 | 225146 | B1BOS01852848 | 150.0 | 1 | 24 | Chung Bolger | Female | hyper@outlook.com | 161068543 | 212044994 | Herbal Deodorant Roll-On, 50ml | 109.52 | 1 | 109.52 | 2021-01-28 | 2021-01 | Thursday | 5 | False | outlook.com | 0 | 2021-01-25/2021-01-31 |
| 7677 | 2021-04-11 06:20:04+00:00 | 407773 | 226498 | B1BOS04711138 | 550.0 | 1 | 24 | Chung Bolger | Female | hyper@outlook.com | 202400 | 212232167 | Ark Small Bamboo Bag | 523.81 | 1 | 523.81 | 2021-04-11 | 2021-04 | Sunday | 6 | False | outlook.com | 0 | 2021-04-05/2021-04-11 |
| 7678 | 2021-04-20 06:31:33+00:00 | 407903 | 226597 | B1BOS0492078 | 5900.0 | 3 | 41 | Rex Ebonie | Female | lahvak@yahoo.com | 161100882 | 213298077 | ONE SHOULDER LONG SL | 213298077 | 2904.76 | 1 | 2904.76 | 2021-04-20 | 2021-04 | Tuesday | 6 | False | yahoo.com | 0 | 2021-04-19/2021-04-25 |
| 7679 | 2021-06-02 08:12:38+00:00 | 408589 | 227135 | B1BOS06330283 | 45.0 | 1 | 16 | Cheryl Mcginnis | Male | mugwump@att.net | 161018551 | 210340825 | Off-White Ballerina Step Kids No-Show Socks | 42.86 | 1 | 42.86 | 2021-06-02 | 2021-06 | Wednesday | 8 | False | att.net | 1 | 2021-05-31/2021-06-06 |
| 7680 | 2021-06-04 05:09:07+00:00 | 408693 | 227221 | B1BOS0650431 | 275.0 | 1 | 16 | Cheryl Mcginnis | Male | mugwump@att.net | 161352739 | 214213861 | COTTON COLLECTION - | 214213861 | 214.29 | 1 | 214.29 | 2021-06-04 | 2021-06 | Friday | 5 | False | att.net | 1 | 2021-05-31/2021-06-06 |
| 7681 | 2021-06-05 08:16:13+00:00 | 408716 | 227233 | B1BOS06110545 | 2850.0 | 11 | 16 | Cheryl Mcginnis | Male | mugwump@att.net | 161353047 | 214213924 | NAKED - DEMI UNDER W | 214213924 | 285.71 | 1 | 285.71 | 2021-06-05 | 2021-06 | Saturday | 8 | False | att.net | 1 | 2021-05-31/2021-06-06 |
| 7682 | 2021-06-06 06:26:37+00:00 | 408730 | 227244 | B1BOS06130611 | 2088.0 | 1 | 16 | Cheryl Mcginnis | Male | mugwump@att.net | 76419 | 211024340 | Black Maureen Leather Flats | 2209.52 | 1 | 2209.52 | 2021-06-06 | 2021-06 | Sunday | 6 | False | att.net | 1 | 2021-05-31/2021-06-06 |
| 7683 | 2021-06-06 07:14:24+00:00 | 408776 | 227283 | B1BOS06890692 | 414.0 | 2 | 24 | Chung Bolger | Female | hyper@outlook.com | 105587 | 207059517 | Skin Rescuer, 75ml | 171.43 | 1 | 171.43 | 2021-06-06 | 2021-06 | Sunday | 7 | False | outlook.com | 0 | 2021-05-31/2021-06-06 |